AITopics | Western GOM

Collaborating Authors

Western GOM

TRIDENT: Benchmarking LLM Safety in Finance, Medicine, and Law

Hui, Zheng, Dong, Yijiang River, Shareghi, Ehsan, Collier, Nigel

arXiv.org Artificial IntelligenceJul-30-2025

As large language models (LLMs) are increasingly deployed in high-risk domains such as law, finance, and medicine, systematically evaluating their domain-specific safety and compliance becomes critical. While prior work has largely focused on improving LLM performance in these domains, it has often neglected the evaluation of domain-specific safety risks. To bridge this gap, we first define domain-specific safety principles for LLMs based on the AMA Principles of Medical Ethics, the ABA Model Rules of Professional Conduct, and the CFA Institute Code of Ethics. Building on this foundation, we introduce Trident-Bench, a benchmark specifically targeting LLM safety in the legal, financial, and medical domains. We evaluated 19 general-purpose and domain-specialized models on Trident-Bench and show that it effectively reveals key safety gaps -- strong generalist models (e.g., GPT, Gemini) can meet basic expectations, whereas domain-specialized models often struggle with subtle ethical nuances. This highlights an urgent need for finer-grained domain-specific safety improvements. By introducing Trident-Bench, our work provides one of the first systematic resources for studying LLM safety in law and finance, and lays the groundwork for future research aimed at reducing the safety risks of deploying LLMs in professionally regulated fields. Code and benchmark will be released at: https://github.com/zackhuiiiii/TRIDENT

arxiv preprint arxiv, large language model, machine learning, (17 more...)

arXiv.org Artificial Intelligence

2507.21134

Country:

Asia > Middle East > UAE > Abu Dhabi Emirate > Abu Dhabi (0.14)
South America > Colombia > Meta Department > Villavicencio (0.04)
Europe > Ireland > Leinster > County Dublin > Dublin (0.04)
(6 more...)

Genre:

Research Report > Experimental Study (1.00)
Research Report > New Finding (0.67)

Industry:

Law (1.00)
Health & Medicine (1.00)
Banking & Finance (1.00)
(3 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

ODYSSEE: Oyster Detection Yielded by Sensor Systems on Edge Electronics

Lin, Xiaomin, Mange, Vivek, Suresh, Arjun, Neuberger, Bernhard, Palnitkar, Aadi, Campbell, Brendan, Williams, Alan, Baxevani, Kleio, Mallette, Jeremy, Vera, Alhim, Vincze, Markus, Rekleitis, Ioannis, Tanner, Herbert G., Aloimonos, Yiannis

arXiv.org Artificial IntelligenceSep-13-2024

Oysters are a vital keystone species in coastal ecosystems, providing significant economic, environmental, and cultural benefits. As the importance of oysters grows, so does the relevance of autonomous systems for their detection and monitoring. However, current monitoring strategies often rely on destructive methods. While manual identification of oysters from video footage is non-destructive, it is time-consuming, requires expert input, and is further complicated by the challenges of the underwater environment. To address these challenges, we propose a novel pipeline using stable diffusion to augment a collected real dataset with realistic synthetic data. This method enhances the dataset used to train a YOLOv10-based vision model. The model is then deployed and tested on an edge platform in underwater robotics, achieving a state-of-the-art 0.657 mAP@50 for oyster detection on the Aqua2 platform.

artificial intelligence, deep learning, machine learning, (20 more...)

arXiv.org Artificial Intelligence

2409.07003

Country:

North America > United States > New Jersey (0.14)
North America > United States > Texas (0.14)
North America > United States > Delaware > New Castle County > Newark (0.14)
(10 more...)

Genre: Research Report (0.64)

Industry: Government > Regional Government > North America Government > United States Government (0.93)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.35)

Add feedback

Temporal Inductive Path Neural Network for Temporal Knowledge Graph Reasoning

Dong, Hao, Wang, Pengyang, Xiao, Meng, Ning, Zhiyuan, Wang, Pengfei, Zhou, Yuanchun

arXiv.org Artificial IntelligenceJan-25-2024

Temporal Knowledge Graph (TKG) is an extension of traditional Knowledge Graph (KG) that incorporates the dimension of time. Reasoning on TKGs is a crucial task that aims to predict future facts based on historical occurrences. The key challenge lies in uncovering structural dependencies within historical subgraphs and temporal patterns. Most existing approaches model TKGs relying on entity modeling, as nodes in the graph play a crucial role in knowledge representation. However, the real-world scenario often involves an extensive number of entities, with new entities emerging over time. This makes it challenging for entity-dependent methods to cope with extensive volumes of entities, and effectively handling newly emerging entities also becomes a significant challenge. Therefore, we propose Temporal Inductive Path Neural Network (TiPNN), which models historical information in an entity-independent perspective. Specifically, TiPNN adopts a unified graph, namely history temporal graph, to comprehensively capture and encapsulate information from history. Subsequently, we utilize the defined query-aware temporal paths on a history temporal graph to model historical path information related to queries for reasoning. Extensive experiments illustrate that the proposed model not only attains significant performance enhancements but also handles inductive settings, while additionally facilitating the provision of reasoning evidence through history temporal graphs.

artificial intelligence, machine learning, temporal reasoning, (17 more...)

arXiv.org Artificial Intelligence

2309.03251

Country:

Asia > Thailand (0.05)
Asia > India (0.04)
Europe > United Kingdom (0.04)
(11 more...)

Genre: Research Report > New Finding (0.67)

Industry: Health & Medicine (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Temporal Reasoning (0.87)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.84)
Information Technology > Artificial Intelligence > Representation & Reasoning > Semantic Networks (0.84)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (0.66)

Add feedback

Temporal Action Localization with Enhanced Instant Discriminability

Shi, Dingfeng, Cao, Qiong, Zhong, Yujie, An, Shan, Cheng, Jian, Zhu, Haogang, Tao, Dacheng

arXiv.org Artificial IntelligenceSep-11-2023

Temporal action detection (TAD) aims to detect all action boundaries and their corresponding categories in an untrimmed video. The unclear boundaries of actions in videos often result in imprecise predictions of action boundaries by existing methods. To resolve this issue, we propose a one-stage framework named TriDet. First, we propose a Trident-head to model the action boundary via an estimated relative probability distribution around the boundary. Then, we analyze the rank-loss problem (i.e. instant discriminability deterioration) in transformer-based methods and propose an efficient scalable-granularity perception (SGP) layer to mitigate this issue. To further push the limit of instant discriminability in the video backbone, we leverage the strong representation capability of pretrained large models and investigate their performance on TAD. Last, considering the adequate spatial-temporal context for classification, we design a decoupled feature pyramid network with separate feature pyramids to incorporate rich spatial context from the large model for localization. Experimental results demonstrate the robustness of TriDet and its state-of-the-art performance on multiple TAD datasets, including hierarchical (multilabel) TAD datasets.

localization, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

2309.0559

Country:

Asia > China (0.04)
North America > United States > Gulf of Mexico > Western GOM (0.04)
North America > Canada > Alberta (0.04)
Europe > Netherlands > North Holland > Amsterdam (0.04)

Genre: Research Report > New Finding (1.00)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.88)
Information Technology > Artificial Intelligence > Representation & Reasoning > Spatial Reasoning (0.86)

Add feedback

Dynamic programming with partial information to overcome navigational uncertainty in a nautical environment

Beeler, Chris, Li, Xinkai, Crowley, Mark, Fraser, Maia, Tamblyn, Isaac

arXiv.org Artificial IntelligenceDec-29-2021

In an MDP, the state of the system is known, however, Uncertainty creates a major obstacle in solving control in a POMDP it must be estimated, leading to some problems. The goal of these problems is to construct a policy amount of uncertainty. Much of the difficulty in solving that is expected to produce optimal trajectories. In some a POMDP stems from estimating the state of the system cases, uncertainty only causes deviations from the optimal before choosing an action. This is where the majority of trajectory, which may still result in an acceptable solution.

artificial intelligence, machine learning, water current, (16 more...)

arXiv.org Artificial Intelligence

2112.14657

Country:

North America > Canada > Ontario > National Capital Region > Ottawa (0.15)
North America > Canada > Ontario > Waterloo Region > Waterloo (0.14)
North America > United States > Gulf of Mexico > Western GOM (0.05)
(3 more...)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (1.00)

Add feedback